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Abstract. To investigate putative sorting domains in 
precursors to polypeptide hormones, we have con- 
structed fusion proteins between the amino terminus of 
preproinsulin (ppl) and the bacterial cytoplasmic en- 
zyme chloramphenicol acetyltransferase (CAT). Our 
aim is to identify sequences in ppl, other than the sig- 
nal peptide, that are necessary to mediate the intracel- 
lular sorting and secretion of the bacterial enzyme. 
Here we describe the in vitro translation of mRNAs 
encoding two chimeric molecules containing 71 and 38 
residues, respectively, of the ppl NH 2 terminus fused 
to the complete CAT sequence. The ppl signal peptide 
and 14 residues of the B-chain were sufficient to direct 
the translocation and segregation of CAT into micro- 
somal membrane vesicles. Furthermore, the CAT 



enzyme underwent N -I inked glycosylation, presumably 
at a single cryptic site, with an efficiency that was 
comparable to that of native glycoproteins synthesized 
in vitro. Partial ammo-terminal sequencing demon- 
strated that the downstream sequences in the fusion 
proteins did not alter the specificity of signal pepti- 
dase, hence cleavage of the ppl signal peptide oc- 
curred at precisely the same site as in the native 
precursor. This is in contrast to results found in 
prokaryotic systems. These data demonstrate that the 
first 38 residues of ppl encode all the information 
necessary for binding to the endoplasmic reticulum 
membrane, translocation, and proteolytic (signal se- 
quence) processing. 



In recent years the process of intracellular protein traf- 
ficking and secretion has been described in detail for 
eukaryotic cells (5) and some of the primary events in 
the secretory pathway have been characterized (26). How- 
ever, the precise molecular mechanisms that regulate intra- 
cellular sorting of secretory proteins are still poorly under- 
stood. Our laboratory is concerned with elucidating putative 
sorting sequences in presecretory proteins, particularly 
those whose secretion is regulated in response to environ- 
mental stimuli. To this end we have been studying the biosyn- 
thesis and posttranslational processing of precursors to the 
pancreatic islet hormones, insulin, glucagon, and somato- 
statin, as models for secretory proteins. Insulin is particu- 
larly appropriate for such studies since it is one of the best 
characterized of all polypeptide hormones, the cDNA and 
gene sequence of preproinsulins (ppl) 1 have been determined 
from numerous species as has the x-ray crystallographic 
structure of the mature hormone (2, 24), In all species, ppl 
is synthesized in pancreatic 0 cells and comprises a signal 
peptide, the B-chain, C-peptide, and A-chain. The signal 



1. Ahhmiatitmx used in this paper: CAT, chloramphenicol acetyltransfer- 
ase; EndoH. endoglycosiduse H; ER, endoplasmic reticulum: pi. proinsu- 
lin; ppl, preproinsulin. 



peptide is cleaved cotransjationally (4, 20) to yield nascent 
proinsulin (pi) in which the B- and A-chains of mature insu- 
lin are joined by the C-peptide, which i» flanked by two pairs 
of basic amino acids. One function of the C-pcptidc is to 
facilitate the correct folding of pi so that the disulfide bridges 
that link the A- and B-chains in mature insulin can form 
efficiently (24). Proteolytic cleavage of pi to insulin occurs 
at the paired basic amino acids by enzyme(s) localized in the 
secretory granules, resulting in secretion of both insulin and 
the C-peptide. Although no topogenic sequences other than 
the signal peptide have been identified in ppl, it is possible 
that the correct folding of pi might be important for efficient 
intracellular transport of the molecule through the secretory 
pathway. Since ppl undergoes only two posttranslational 
modifications, i.e., disulfide bridge formation and proteoly- 
sis, it represents a relatively simple peptide hormone precur- 
sor in which to investigate putative topogenic sequences. 

Using ppl as a model, we have previously demonstrated 
(4) that there is a minimum size (60-70 amino acids) for 
productive interaction of the nascent precursor with signal 
recognition particle and the endoplasmic reticulum (ER) 
membrane. T determine if this minimum size might also 
c nstitute the minimum amount of structural information 
necessary t facilitate translocation of any nascent proteins 
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across the ER membrane, we have used recombinant DNA 
techniques to construct fusion proteins between ppl and the 
bacterial enzyme chloramphenicol acetyltransferase (CAT)* 
Since CAT n rmally resides in die bacterial cytoplasm, the 
aim of these studies is to identity and characterize putative 
sorting domains in ppl and pi that could mediate the intracel- 
lular transport and secretion of a normally nonsecretory pro- 
tein, i.e., the bacterial cytoplasmic enzyme CAT. 

As a first step towards defining putative topogenic deter- 
minants in ppl, we have investigated the in vitro biosynthesis, 
membrane translocation, and cotranslational modifications 
of ppI-CAT fusions. Here we report mat the ppl signal pep- 
tide and a portion of the B-chain can mediate the efficient 
translocation of CAT into mammalian microsomal mem- 
brane vesicles. In addition, a single cryptic N-linked gly- 
cosylation site present in the CAT molecule is apparently 
recognized by the microsomal membranes, such that a glyco- 
sylated form of CAT is synthesized in vitro with an efficiency 
that is comparable to native glycoproteins. 

Materials and Methods 
Materials 

Plasmid pDSS was a gift from Dr. & Dobberstein. European Molecular Bi- 
ology Laboratory. Heidelberg, Federal Republic of Germany. Rabbit 
anti-CAT serum was a gift from Dr. D. Wong (Albert Einstein College of 
Medicine), and guinea pig anti-porcine insulin was purchased from Miles 
Laboratories. Endoglycosidase H was a gift from Dr. P. Atkinson (Albert 
Einstein College of Medicine). |'H|leucine and |"S]cysteine were pur- 
chased from Amersham/Searle Corp., Arlington Heights, IL at the highest 
available specific activity. Escherichia coli RNA polymerase and 7 mCpppA 
were purchased from P L Biochemicals. Inc.. Milwaukee, WI. Restriction 
enzymes were purchased from BRL, Gaithersburg. MD or New England 
Biolabs. Beverly, MA. and used as recommended by the manufacturers. 

Methods 

Construction of an islet cDNA Library. A pancreatic islet cDNA library 
uas constructed from unglcrfish [Lttphius americamts) polyA-containing 
mRNA exactly as described by Gubter and Hoffman (8). Bacterial transfor- 
mants »rre screened for ppl inserts by colony hybridization (6) using a nick- 
tianslated 220-bp PstI fragment of a partial opt cDNA done. pAFMl. 
which was isolated from a previous >:DNA library (Z7). This done had been 
shown to be specific for ppl by hybrid-select translation of anglerfish 
mRNA. Several positive clones were analyzed in detail by restriction map- 
ping and by didcoxy DNA sequencing (IS); one such clone, designated 
p4l3-Il. was found to contain full-length ppl cDNA. 

Construction ofppI-CAT Hybrid Genes. A scheme for the construction 
of the hybrid cDNAs is outlined in Fig. I. Clone p4D-II was digested with 
Pstl. and the 220-bp fragment, encoding the ppl signal peptide. B-chain. 
and a portion of the C-peptide. was purified by polyacrylamidc gel elec- 
trophoresis. The Pst I fragment was blunt-ended by digestion with T4 DNA 
polymerase, and EcoRI linkers (12 bp) were added. I Mg of this Fsti/EcoRI 
fragment to digested with Sail, and the 5' overhang was filled in with the 
Ktenow fragment of DNA polymerase I and ligated to EcoRI linkers (12 bp) 
i Fig. 1 A ). Both the Pstl fragment and the Sail fragment containing EcoRI 



ends were ligated into pDS5 that had been digested with EcoW and alkaline 
phosphat a se (Fig. 1 B). The resulting plasnrids, p5PI£AT and pSSJCAT. 
encode chimeric ppI-CAT fusions designated ppPUCAT and ppSUCAT. 
which have 313 and 280 amino acids, respectively. ppPlJCAT contains me 
ppl signal peptide. B-chain. and 17 residues of the C-chain. as well as 23 
residues from the polylinker of pDSS (25), and the entire CAT sequence 
(Fig. 1 &. top). ppSICAT possesses the ppl signal peptide. 14 amino adds 
of the B-chain, 23 residues of the polylinker, and the entire CAT sequence 
(Fnj. 1 & bottom). Hie predicted amino add sequence encoded by the pDS5 
polylinker in both plasrnids is: Ais-Asn-Ser-Ar^ly-SerAU-Asp-Leu-Gln- 
Pro-Scx-Leu^-Aig-Ptie-Scr^ly^^ 

In Wm> 7hm*CT^pt»Jt. DNA from 
by CsCl cemrifugation and was transcribed in vitro using £. RNA poly- 
merase and the cap analogue 7 mGpppA exactly as described by Stueber 
et al. (25). 

In Vitro Translation. Cell-free translation of in vitro transcribed mRNA 
was performed as previously described (20) using the wheat germ cell-free 
system containing 800 uCi/ml (^cysteine and 1 mCt/ml (*H|leucine. 
The isolation of anglerfish islet mRNA and canine microsomal membranes 
and their use in the wheal germ system was as previously described (4. 20. 
21). Intmun op i ec ipiu tton of the translation products was as described (13) 
with the following modifications: aliqnots of the tra n slation products were 
adjusted to 2% SDS. 4 mM L-cysteine, and incubated at 42°C for S min. 
followed by addition of 5 vol of immunoprtcipitaiion buffer (13). The appro- 
priate antiserum (3 ul anti-CAT, 8 ul anti-insulin) was added, and the sam- 
ples incubated at 4°C overnight. Immunoprecipitates were treated with pro- 
tein A Sepharose and washed four times. The final pellet was resuspended 
in SDS BvGE loading buffer and incubated at 60°C for 3 min, followed by 
alkylation and analysis by SDS MGE on 15% polyacrylamidc gets. 

Assay for Translocation of CAT**ttated Mypeptides Into Stentbrant 
ttricirs. Resistance to posttranslational proteolysis was used to assay for 
the segregation of nascent pl-CAT fusions into microsomal membrane vesi- 
cles. Altquots of the translation products synthesized in the absence and 
presence of microsomal tnembranes were adjusted to 3j6 mM tetracaine and 
digested with 250 ugftnl each of trypsin and chymotrypsin as previously de- 
scribed (21). After incubation for I h at 0°C the digestions were terminated 
by adjusting the samples to 2 mM PMSF and 800 U/ml Irasylol. Samples 
were then prepared for SDS RAGE. 

Endbgfjcoddasv H (EndoH). Glycosytation of pl-CAT fusion* was as- 
sayed by sensitivity to digestion with EndoH. translation products synthe- 
sized in the presence of microsomal membranes were adjusted to \% SDS 
in a total volume of 18 ill and incubated at 95°C for 3 min. 6 ul of 0.1 M 
citrate phosphate buffer. pH 50. were added, followed by I ul of EndoH 
(004 U); I ul of water was added to control samples. Incubations were for 
16-20 h at 37°C; the digestion was terminated by incubating the samples 
at 95°C for 3 min followed by precipitation in 10% cold TCA-containing / 
2 mM cysteine. The TCA pellets were rcsuspended in SDS gel loading 
. buffer and analysed by SDS WGE. ; ' 

Partial NHrteminal Sequencing of the Fusion Proteins. Appropriate 
bands were located by autoradiography, excised from the dried gel. and sub-, 
jected to up to 40 cycles of automated Edman degradation using a spinning; . 
cup sequencer (model 890C; Beckman Instruments Inc.. Fullcrton, CA)- 
as previously described (2a 22), with the following modification. Radiola- 
beled rjolypeptides were dectrophoretically duted as follows: the gel slices 
were itinerated in electrophoresis tank buffer (005 M Iris. 038 M glycine) 
containing 2% SDS; the lehydrated gel pieces were placed in an electro- 
phoretic concentrator elution chamber (model 1750; Isco. Inc.. Lincoln. 
NE) containing 001 M Tris. 0077 M glycine, and 01% SDS: the apparatus 
was filled with decttophorcsts tank buffer containing 0 1 2> SDS. Electroclu- 
tkm was performed at 1 W (constant power) overnight. The eluate (200 ul ) 
was dinned with an equal volume of sterile water, diatyxed briefly against 
water, and loaded into the sp inning cup of the sequencer containing 4 mg 
polybrene and I mg of myoglobin (27). 



Generation of pSPI.CAT and p5SI.CAX. pDSS was digested with EcoRI and dephosphorylaicd with calf intestinal phosphatase (CIP). The 
vector was then ligated with either the P$tl fragment or the Sail fragment, both containing EcoRI cohesive ends, and tte ligation react wn 
used to transform £. coli strain MCiOOO. Positive clones were identified by colony hybridization using the nick-translated ^fragrrtcnt 
of p4D-ll as a probe. Those clones containing either the Pstl or Sail fragment were amplified and plasmid DNA was prepared* The Pj^js 
P 5P1 CAT and pSSI.CAT were subjected to coupled transcription and translation, yielding the two fusion proteins ppPl.CAT and ppSUJU . 
bla. (Mactamase gene; SP, signal peptide; R B-chain; C, C-peptide; P, Pstl; S, Sail. Hatched bar in p4l3-II represents ppl ^^^n« 
open bar represents ppl coding region in the fragments; filled-in region represents EcoRI linkers. L. polylinker rcgwn of pDS5>: Wo. 
coliphage promoter; RBS. prokaryotic ribosome binding site; t„ transcription terminator of the rnnB opemn in £. a>Ii: and h. hcoKi. 
Arrowhead indicates the signal cleavage ,*; triangle indicates the cryptic site for N-linked glycosylation in the CAT protein; double aster- 
isks indicate the paired basic cleavage site Lys-Arg between the B-chain and the C-peptide of ppPLCAT. 
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Results 

Construction of ppI-CAT Fusions 

The c nstruction f the two ppI-CAT fusion proteins is di- 
agrammed in Fig. 1. Plasmid pSPI.CAT encodes a hybrid 
protein f 313 amino acids comprising the anglerfish ppl sig- 
nal peptide (24 residues), the complete B-chain (30 resi- 
dues), the first 17 residues of the connecting peptide (C-pep- 
tide), a 23 amino acid peptide encoded by the linker region 
of pDS5 (see Materials and Methods), and the complete CAT 
sequence (219 amino acids). Plasmid pSSI.CAT encodes a 
hybrid protein of 280 amino acids possessing the complete 
ppl signal peptide, the first 14 residues of the insulin B-chain, 
the same 23 residue polypeptide encoded by the linker region 
of pDS5, and the complete CAT sequence. In addition, both 
plasmids encode a cryptic site for N-linked glycosylation 
(Asn-Gln-Thr) present at residues 34 through 36 of the native 
CAT molecule; this sequence starts at position 128 in 
ppPLCAT and at residue 95 in ppSLCAT. The structure of 
these plasmids was confirmed by detailed restriction digests, 
DNA sequencing, and partial amino acid sequence analysis 
(Fig. 5) of the translation products. 

In Vitro Biosynthesis of Fusion Proteins 
Initially we determined if the NHrterminus of ppl could di- 
rect the translocation of CAT into mammalian microsomal 
membranes. Tb this end, RNA transcribed in vitro from 
plasmids P 5PI.CAT and p5SI.CAT was translated in the 
wheat germ cell-free system in the absence and presence of 
microsomal membranes (Fig. 2). In the absence of micro- 

A 

12 3 4 5 6 7 8 12 3 4 




s mes the translation products encoded by both plasmids 
P5PI.CAT and p5SI.CAT were significantly larger than an- 
glerfish ppl or native CAT (Fig. 2 A, lanes 7, 5, J, and 7) 
and were of the expected size for the two predicted fusi n 
proteins, i.e., Af, 31j000 for ppPLCAT and M T 28j000 for 
ppSLCAT. T confirm that these products were fusi n pro- 
teins between ppl and CAT, the translation products were 
treated with antibodies directed against porcine insulin or 
CAT (Fig. 2, B and C, lanes / and J, respectively). Both 
ppPLCAT and ppSLCAT were immunoprecipitated with 
anti-insulin and anti-CAT antibodies, indicating they had the 
predicted antigenic determinants. Some cross-reactivity be- 
tween the anti-insulin serum and the CAT translation prod- 
ucts generated from transcription of the parent vector pDS5 
was noted (Fig. 2 B t lanes 5 and 6). This cross-reactivity, 
which was variable with different batches of antisera, is most 
likely due to the presence of trace levels of endogenous CAT 
antigen, synthesized by R subtilis, a component of Freund's 
complete adjuvant used for the initial immunization of the 
animals. Consequently, the final serum may contain a low 
level of CAT antibodies in addition to anti-insulin antibodies. 
Both ppl and pi (Fig. 2 B, lanes 7and 8) were efficiently rec- 
ognized by the anti-insulin antibody, while, as expected, the 
anti-CAT antibodies showed no cross-reactivity with ppl or 
pi (Fig. 2 C, lanes 7 and*). 

In the presence of microsomal membranes, nascent ppl 
was cotranslationally cleaved to pi (Fig. 2 &, lanes 7and 8; 
reference 20). Surprisingly, the translation products from 
ppPLCAT and ppSLCAT mRNA synthesized in the presence 
of membranes were processed to two forms. One form was 
of slightly foster mobility (Fig. 2, A % B % and C; lanes 2 and 
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Ftgurv 3. Segregation of ppI-CAT fusions. Plasmids p5PI.CAT and pSSI.CAT were transcribed in vitro and the RNA translated in the ab- 
sence <-) and presence ( +) of 3 AWml microsomal membranes. After translation, aliquots were adjusted to 10 mM CaCI;. 3 mM tetra- 
caine and treated with either protease (T/C 250 |ig/ml each of trypsin and chymotrypsin) or with protease in the presence of 1% Triton 
X-100 (TX). Samples were incubated for I h at 4°C and then treated with 2 mM PMSF and prepared for SDS PAGE. {A ) Products of 
pSPI.CAT. (&) Products of pSSI.CAT. Lane /, products synthesized in the absence of membranes: lane 2, products synthesized in the pres- 
ence of membranes; lanes 3 and 4, as lanes / and 2 but treated with trypsin and chymotrypsin; lanes 5 and 6\ as 3 and 4 except that proteolysis 
was performed in the presence of I % Triton X-HJOt Urge arrowhead (left of lanes A I and B I ), ppPLCAT and ppSICAT. respectively. 
Lanes 2 and 4. downward and upward pointing arrowheads, protease resistant forms of fusion protein: lane 2, asterisks, residual ppPLCAT 
and ppSI.CAT. respectively. 



4 % lower arrow) and was presumably the fusion protein minus 
its signal peptide. The second form of processed fusion pro- 
tein migrated more slowly than the precursor (Fig. 2, A, B, 
and C; lanes 2 and 4, downward pointing arrows). No such 
processed forms of CAT were seen when native CAT. en- 
coded by pDS5, was synthesized in the absence or presence 
of microsomal membranes (Fig. 2, A and C lanes 5 and 6*). 
The translation products from pDS5 appeared to migrate as 
a doublet on SDS gels; the reason for this is unclear. How- 
ever, the appearance of this doublet was unaffected by the 



presence or absence of microsomal membranes, indicating 
that it is not due to the incorporation of CAT into microsomes. 

Segregation of the Fusion Proteins 

To further analyze the nature of the fusion proteins synthe- 
sized in the presence of microsomal membranes, the trans- 
lation products were assayed for translocation into the 
microsomal vesicles by determination of their sensitivity to 
protease digesti n (Fig. 3). Aliquots of the translation prod- 
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of fusion protein. (Upivard pointing arrows) nonglycosylated forms of fusion protein lacking its signal peptide *™ 



ucts were treated with a mixture of trypsin and chymotrypsin 
in the absence and presence of the detergent Triton X-100. 
Fusion proteins ppPLCAT and ppSI.CAT synthesized in the 
absence of membranes were completely sensitive to protease 
treatment (Fig. 3, A and B, lane 3) indicating that these pro- 
teins were not intrinsically protease-resistant. In contrast, 
the two putative processed forms of each fusion protein, 
pPI.CAT and pSI.CAT, synthesized in the presence of mem- 
branes, were protease-resistant (Fig. 3, A and B % lane 4) t in- 
dicating that they were shielded by the membrane bilayer. 
This was confirmed when proteolysis was performed in the 
presence f Triton X-100. In this case, these products were 
c rnpletely digested (lane 6*). These data indicate.that the 



two processed forms of each fusion protein were completely 
segregated into the cistemae of the microsomal vesicles. 

Glycosylation of ppI-C AT Fusions 

The appearance of a slower migrating form of processed fu- 
sion proteins, resulting from synthesis in the presence of 
microsomal membranes, was similar to that seen for numer- 
ous glycoproteins synthesized in vitro (e.g., VSV-G protein 
(13)). We therefore hypothesized that the molecules of slower 
electrophoretic mobility were glycosylated forms of the fu- 
sion proteins, whereas those migrating faster than ppPI.GAT 
or ppSLCAT c rresponded to processed molecules in which 
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the insulin signal peptide was cleaved but which were not 
glycosylated. Although neither ppl n r CAT are normally 
glycosylated in vivo, CAT does encode a cryptic recognition 
site for N-linked glycosylation: Asn-Gly Thr, at residues 
34-36 of the native m lecule. This site appears at residues 
128-00 in ppPLCAT and residue 95-97 in ppSLCAT. Since 
ppl sequences in both fusions mediated translocation of CAT 
into microsomal vesicles, it is possible that this site was rec- 
ognized by the glycosylation enzymes in the microsomal 
membranes. The prediction that the higher molecular weight 
forms of the fusion proteins (Fig. 4, lanes 2 and 6) were gly- 
coproteins was tested by subjecting the translation products 
to treatment with EndoH. Since high mannose core oligosac- 
charides added to nascent glycoproteins in the ER can be 
cleaved by EndoH digestion, the glycosylated fusion proteins 
should be sensitive to this enzyme. EndoH digestion was per- 
formed on translation products generated from pSPLCAT 
and pSSLCAT synthesized in the presence of microsomal 
membranes, and the samples were analyzed fay SDS PAGE 
(Fig. 4). In the presence of EndoH (lanes 3 and 7), the 
slower migrating species of both fusions (upper arrows) was 
quantitatively converted to a form which then co-migrated 
with the faster migrating species observed in the presence of 
membranes (lanes 2 and 6\ lower arrows); Lc. , after removal 
of the single N-linked carbohydrate chain, the resulting prod- 
uct co-migrated with a protein that represented the fusion 
protein minus its putative signal peptide. 

During the course of these experiments, we noted that both 
proteolytic processing of the signal peptide and glycosyla- 
tion of pPLCAT and pSLCAT were particularly efficient, as 
judged by autoradiographic intensity (Figs. 3 and 4). lb quan- 
titate the relative efficiencies of proteolytic processing and 
glycosylation, gel bands corresponding to both the glycosyl- 
ated and unglycosylated forms of each fusion protein were 
excised from the dried gels, solubilized, and their radioactiv- 
ity determined directly (Table I). In several experiments in 
which 43% of nascent ppl was processed to pi, the fusion 
proteins were processed with an efficiency of 65-70%. Simi- 
larly, glycosylation of the fusion proteins was particularly 
efficient, *v45% of pPLCAT and 68% of pSLCAT were 
glycosylated. These values arc comparable to the efficiency 
of glycosylation of native glycoproteins synthesized under 
these conditions (data not shown). 



Table L Efficiency of Processing and Glycosylation 
of ppi-CAT Fusions 



Protein 



Processing* 



Glycosylation* 



43.4 (5) 

66.5 (3) 

69.6 (3) 



NA 

45.6 (3) 
68.0 (3) 



PPl 

ppPI.CAT 
ppSI.CAT 

The appropriate polypeptides were excised from the dried gel <Fig. 4) solubi- 
lized in 30* H r O : (4), ami the radioactivity determined by liquid scintillation 
counting. Numbers in parenthesis represent total number of experiments. NA. 
not applicable. 

cpm in processed forms (upper and lower ! ^ |Q0 
(cpm in precursor + cpm in processed forms) 

* Glycosylation 



' Processing 



cpm in upper processed form ^ ^ 



(cpm in lower + cpm in upper forms) 



RfftM NHrterminal Sequencing of the Fbsum Proteins 

The antibody precipitati n data (Fig. 2) indicated that both 
ppPLCAT and ppSLCAT were fusion proteins between ppl 
and CAT. However, since the anti-insulin antibodies had 
some cross-reactivity with authentic CAT, it was necessary 
to unequivocally demonstrate that the fusion proteins were 
indeed those predicted. We therefore subjected ppPICAT 
and ppSLCAT to partial NHrterminal sentencing. Previous 
studies (9, 22) had shown that leucine residues were present 
at positions 3, 5, 10, 12, 13, and 14 of the signal peptide, and 
leucine and cysteine were present at positions 30 and 31, 
respectively, of ppl (corresponding to residues 7 and 8 of the 
insulin B-chain). Consequently, ppPLCAT and ppSLCAT 
were synthesized in the presence of [ s H)leucine and 
("S]cysteine and electrophoresed on 15% polyacrylamide 
gels. The appropriate bands were localized by autoradiogra- 
phy, the proteins eluted and subjected to microsequencing 
(Fig. 5, A and B). Leucine and cysteine residues were found 
at the expected positions; these results not only confirm the 
accuracy of the antibody data but conclusively demonstrate 
that the fusion proteins contained the ppl signal peptide. 

It is possible that foreign sequences downstream from the 
signal peptide might influence the site of signal peptidase 
cleavage (1). Therefore it was of interest to determine if both 
of the precursor fusion proteins were correctly cleaved by 
signal peptidase, particularly since ppSI CAT contained only 
fourteen residues of the insulin B-chain. The polypeptides 
corresponding to the glycosylated forms of pPLCAT and 
pSLCAT (Fig. 4, upper band) synthesized in the presence 
of [ 3 H]leucine, [ M S]cysteine and microsomal membranes, 
were eluted from gels and also subjected to microsequenc- 
ing, (Fig. 5, CandD). Previously, it had been shown that 
cleavage of the signal peptide occurred between residues 24 
and 25 of anglerfish ppl (20). Consequently, if ppPLCAT 
were accurately cleaved by signal peptidase, leucine residues 
would be present at positions 7, 12, 16, and 18, and cysteine 
at residues 8 and 2a The data (Fig. 5 C) show that this was 
the case and is consistent with correct cleavage of the signal 
peptide. The sequence data from the glycosylated form of, 
pSLCAT (Fig. 5 D) also demonstrated leucine residues at > 
positions 7 and 12 and a cysteine residue at position & In 
addition, novel leucine residues were found at positions 23 
. and 27. These leucines correspond to those predicted from 
DNA sequencing of the linker region from pDS5. Most im- 
portantly, the data demonstrate that the signal peptide of 
ppSLCAT was also cleaved at the same position as in die na- 
tive precursor, even though the fusion protein contains only 
fourteen residues of the insulin B-chain. Thus, our results 
demonstrate that the downstream sequences do not influence 
die site of cleavage of the ppl signal peptide. 

Discussion 

Most small polypeptide hormones (less than ~50 amino 
acids) are synthesized as part of a larger precursor molecule; 
in some cases the precursor may also be a polyprotein con- 
taining repeating units of the same peptide or several differ- 
ent hormones (3). We are attempting to decode putative sort- 
ing information that may be present in the proregkms of 
a variety f diverse peptide hormone precursors. To this 
end, we have synthesized chimeric genes encoding variable 
amounts of the NH r terminus of ppl fused to CAT. a bac- 
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Figure 5. P&rtial NH 2 -terminal sequence of ppI-CAT and pI-CAT fusions. Plasmids p5PI.€AT and pSSI.CAT were transcribed and trans- 
lated in the wheat germ system containing both ["SJcysteine and [ 3 H]Ieucine (1 mCi/ml of each) in the absence and . presence of 
microsomal membranes and the translation products resolved by SDS PAGE. After autoradiography, bands corresponding to ppPI.CAT 
(A ), ppSI.CAT (B). the glycosylated forms of pPI.CAT (C), and pSI.CAT (D) were excised and the polypeptides electrophoretically etuted. 
The eluted samples were applied to a sequencer (model 890C; Beckman Instruments, Inc.) and subjected to automated Edman degradation. 
(Solid line) [*H]leucine; (dashed line) ("SJcysteine; (asterisks) known or predicted leucine and cysteine residues that were confirmed in 
this analysis; (arrowhead; A, C, and D) site of signal peptide cleavage (20). (Heavy black line; D) indicates novel amino acids encoded 
by the linker region of pDS5 (see Materials and Methods). Single letter code: A, Ala; C Cys; D, Asp; F, Phe; G, Gly; H, His; L, Leu; 
M, Met; N, Asn; P, Pro; Q. Glu; R, Arg; S, Ser; V, Val; W, Trp; Y, Tyr. 



terial cytoplasmic enzyme. ;The rationale for these experi- 
ments is to identify sequence information within ppl that 
could mediate the sequestration of CAT molecules into the 
secretory pathway, perhaps ultimately leading to its secre- 
tion. Since we had shown (4) that ppl interacts with the ER 
when about half the molecule has been synthesized, we 
postulated that sorting information, in the first 60 residues 
of ppl, which includes the NHrterrninal signal peptide, 
should be sufficient to effect translocation of any protein 
across the ER membrane. 

To test this hypothesis directly, we have constructed two 
fusion proteins. One, ppPI.CAT, contained the first 71 resi- 
dues of ppl, including the complete signal peptide and E-chain 
and part of the C-peptide fused to CAT. The other fusion pro- 
tein, ppSI.CAT, possessed only 38 amino acids f ppl (the 
signal peptide plus 14 residues f the B-chain) fused to CAT. 



It is noteworthy that the poly linker sequence, present in both 
fusion proteins, contained seven charged residues; conse- 
quently, it might be expected that these could interfere with 
membrane translocation. However, both fusions, ppPI.CAT 
and ppSI.CAT, were capable of targeting to microsomal 
membranes, as well as translocating through the lipid 
bilayer, suggesting that local charge effects per se may not 
necessarily inhibit translocation. Since the relative efficien- 
cies of glycosylation and signal peptide cleavage were virtu- 
ally identical in the two constructions (Table I), it is possible 
that ppSI.CAT, as well as ppPI.CAT, contains all the neces- 
sary structural domains to effect efficient translocation of for- 
eign proteins into the lumen of the ER. This result was some- 
what surprising, since a construction of 45 residues, which 
encodes only the first 38 amin acids of ppl and no CAT se- 
quences, bound poorly to microsomal membranes and was 
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inc mpeteni for transl cati n(Eskridge, E., and D. Shields, 
manuscript submitted for publication). These results suggest 
that, providing the 38 Nrfc-terminal residues f ppl can as- 
sume an appropriate conformati n, by virtue of being fused 
to foreign sequences, in this case CAT, targeting and translo- 
cati n domains in the nascent precursor can be recognized 
by the ER translocation machinery. 

Our data contrast with results seen from an analogous con- 
struction in prokaryotes. When the signal peptide and the 
first IS amino acids of LamB were fused to die LacZ gene 
and expressed in £ co/i, ^galactosidase remained in the 
cytoplasm, suggesting that additional LamB sequences are 
necessary to direct insertion of the hybrid protein into the in- 
ner membrane (IS). Similarly, when the P-lactamase signal 
peptide was fused to chicken triosephosphate isomerase (11) 
only 30% of the fusion protein was targeted to the membrane 
in vivo; no signal peptide cleavage or translocation across the 
membrane was observed. At present we do not know if the 
insulin signal sequence alone is sufficient to mediate translo- 
cation of CAT into microsomal vesicles of if die fourteen 
residues of the B-chain are also required; these experiments 
are currently in progress. 

It is noteworthy that the downstream sequences in both fu- 
sion proteins had no influence on the specificity of cleavage 
by signal peptidase. This was particularly striking in the case 
of ppSLCAT, which contained only 14 amino acids of the 
B-chain fused to 242 foreign residues. Nevertheless, cleav- 
age of the ppl signal peptide occurred at precisely the same 
site as in the native precursor, i.e., between Ala 24 and Vab; 
this was also the case for the ppPLCAT fusion. These results 
contrast with recent data on the signal peptide cleavage of 
Staphylococcus aureus protein A (1). In this case, when an 
internal IgG-binding fragment of protein A was inserted im- 
mediately adjacent to the signal sequence (replacing the nor- 
mal sequence), incorrect cleavage of the signal peptide was 
observed and transport into the periplasm was significantly 
less efficient than for wild-type protein A (1). These data sug- 
gest that in prokaryotic cells, at least gram-positive bacteria, 
the structure of the polypeptide chain distal, to the site of 
signal cleavage may affect proteolytic processing by signal 
peptidase. 

Several experiments have demonstrated that prokaryotic 
signal sequences, e.g. , £. coli ^-lactamase, can be efficiently 
recognized by the eukaryotic translocation apparatus, such 
that proteolytically processed P-lactamase was sequestered 
into mammalian microsomal membranes in vitro (12, 16). 
Lingappa et ah (14) also showed that the signal peptide and 
the first five amino acids of P-lactamase were sufficient to 
effect translocation of normally cytoplasmic globin chains 
into the lumen of microsomal membranes. In contrast to 
studies on prokaryotic cells (IS), these results indicated that 
relatively little structural information other than the signal 
sequence may be needed to effect translocation of a protein 
across the ER membrane, indeed, very recent experiments 
(17) suggest that a precise fusion of the ^-lactamase signal 
peptide to appropriately engineered a-globin chains is suf- 
ficient to mediate translocation into the ER vesicles. Our 
results demonstrate that a eukaryotic signal sequence medi- 
ates translocation of a prokaryotic cytoplasmic protein across 
mammalian membranes. As such, this sh uld enable us to 
distinguish between topogenic sequences sufficient for trans- 
locati n into the ER lumen from those putative domains 



needed to mediate distal sorting events in the eukaryotic 
secretory pathway. 

Two recent reports have also demonstrated cryptic gryco- 
sylati n of n rmally nonglycosylated proteins (19, 23). 
Spiess and Lodish (23) showed that fusion of the mem- 
brane-anchor domain of the asialoglycoprotein receptor to 
rat a-tubulin was sufficient to mediate translocation and gly- 
cosylation of normally cytoplasmic tubulin by microsomal 
membranes in vitro. Sharma et al. (I?) constructed a chi- 
meric gene, comprising the influenza virus hemagglutinin 
signal peptide and SV40 large T antigen. Expression of this 
gene in 3T3 cells resulted in T antigen being exclusively 
localized to the ER, where both signal peptide cleavage and 
core oligosaccharide addition occurred. However, T antigen 
remained in the ER and was not transported through the 
secretory pathway. In this context, the ppI-CAT fusions de- 
scribed here, as well as several under construction, offer an 
excellent experimental system in which to investigate the 
function of the pi B-chain and C-peptides, as well as re- 
linked glycosylation, on the secretion of the CAT enzyme 
from several different cell types. In particular it will be of 
interest to determine if glycosylation plays a role in either the 
translocation of CAT molecules from die ER to the Golgi, 
c.f. yeast proalpha factor (10), or in post-Golgi sorting events 
(7). These experiments are currently in progress. 
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